Data Mining Opportunities In Very Large

نویسنده

  • Kamalakar Karlapalem
چکیده

Information overload can hamper extracting hidden knowledge from a database. Data mining techniques ooer automated exploratory data analysis of databases. The mining process reveals knowledge buried in the data and provides insights into these data. Conventional mining techniques such as neural nets, regression analysis, and the discovering of rules uncover hidden information from user-provided training examples. A user indicates several of objects which are of interest to him (positive training examples) and few objects which are of no interest to him (negative training examples). The system then automatically discovers all interesting objects. In this position paper we present data mining opportunities that arise in very large object oriented databases, wherein the users no longer need to provide training examples. Data mining is achieved by user set priorities, user speciied guessed queries, and the notion of complete known information (i.e. information closure) about an object. From these inputs, hidden knowledge buried among huge amounts of information can be extracted in a user-driven manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Themes and Lessons for Data

Data mining is on the interface of Computer Science and Statistics, utilizing advances in both disciplines to make progress in extractinginformationfrom large databases. It is an emerging eld that has attracted much attention in a very short period of time. This article highlights some statistical themes and lessons that are directly relevant to data mining and attempts to identify opportunitie...

متن کامل

A Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis

Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...

متن کامل

A Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis

Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...

متن کامل

Computational Challenges in Data Mining

Data mining is applied in business to find new market opportunities from data stored in operational data bases which are used for dayto-day management. The tools applied combine ideas from statistics, machine learning, data base technology and high performance computing to find nuggets of knowledge. Data mining is also applied in science for example to find taxonomies of variable stars and in t...

متن کامل

New Opportunities in Marketing Data Mining

INTRODUCTION Data mining has been widely applied in many areas over the past two decades. In marketing, many firms collect large amount of customer data to understand their needs and predict their future behavior. This chapter discusses some of the key data mining problems in marketing and provides solutions and research opportunities.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007